AITopics | Okayama

Collaborating Authors

Okayama

Language Model Tokenizers Introduce Unfairness Between Languages

Neural Information Processing SystemsFeb-14-2026, 13:54:12 GMT

Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different languages can have drastically different tok-enization lengths, with differences up to 15 times in some cases. These disparities persist even for tokenizers that are intentionally trained for multilingual support.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > Haiti (0.14)
Asia > Philippines > Luzon > Ilocos Region > Province of Pangasinan (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(38 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

An Adaptive Resonance Theory-based Topological Clustering Algorithm with a Self-Adjusting Vigilance Parameter

Masuyama, Naoki, Toda, Yuichiro, Nojima, Yusuke, Ishibuchi, Hisao

arXiv.org Machine LearningDec-9-2025

Clustering in stationary and nonstationary settings, where data distributions remain static or evolve over time, requires models that can adapt to distributional shifts while preserving previously learned cluster structures. This paper proposes an Adaptive Resonance Theory (ART)-based topological clustering algorithm that autonomously adjusts its recalculation interval and vigilance threshold through a diversity-driven adaptation mechanism. This mechanism enables hyperparameter-free learning that maintains cluster stability and continuity in dynamic environments. Experiments on 24 real-world datasets demonstrate that the proposed algorithm outperforms state-of-the-art methods in both clustering performance and continual learning capability. These results highlight the effectiveness of the proposed parameter adaptation in mitigating catastrophic forgetting and maintaining consistent clustering in evolving data streams. Source code is available at https://github.com/Masuyama-lab/IDAT

algorithm, ami 0, threshold, (16 more...)

arXiv.org Machine Learning

2511.17983

Country:

Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.04)
Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.04)
North America > United States > California > Orange County > Irvine (0.04)
(3 more...)

Genre:

Research Report > New Finding (0.45)
Research Report > Promising Solution (0.34)

Industry:

Education > Educational Setting (0.67)
Leisure & Entertainment > Games > Computer Games (0.40)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Sub-exponential Growth of New Words and Names Online: A Piecewise Power-Law Model

Watanabe, Hayafumi

arXiv.org Artificial IntelligenceNov-12-2025

The diffusion of ideas and language in society has conventionally been described by S-shaped models, such as the logistic curve. However, the role of sub-exponential growth -- a slower-than-exponential pattern known in epidemiology -- has been largely overlooked in broader social phenomena. Here, we present a piecewise power-law model to characterize complex growth curves with a few parameters. We systematically analyzed a large-scale dataset of approximately one billion Japanese blog articles linked to Wikipedia vocabulary, and observed consistent patterns in web search trend data (English, Spanish, and Japanese). Our analysis of 2,963 items, selected for reliable estimation (e.g., sufficient duration/peak, monotonic growth), reveals that 1,625 (55%) diffusion patterns without abrupt level shifts were adequately described by one or two segments. For single-segment curves, we found that (i) the mode of the shape parameter $α$ was near 0.5, indicating prevalent sub-exponential growth; (ii) the peak diffusion scale is primarily determined by the growth rate $R$, with minor contributions from $α$ or the duration $T$; and (iii) $α$ showed a tendency to vary with the nature of the topic, being smaller for niche/local topics and larger for widely shared ones. Furthermore, a micro-behavioral model of outward (stranger) vs. inward (community) contact suggests that $α$ can be interpreted as an index of the preference for outward-oriented communication. These findings suggest that sub-exponential growth is a common pattern of social diffusion, and our model provides a practical framework for consistently describing, comparing, and interpreting complex and diverse growth curves.

correlation, large language model, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2511.04106

Country:

Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.14)
Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.04)
Europe > France (0.04)
(5 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Media > News (1.00)
Consumer Products & Services (0.92)
Leisure & Entertainment (0.92)
(2 more...)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

74bb24dca8334adce292883b4b651eda-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 22:13:22 GMT

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > Haiti (0.14)
Asia > Philippines > Luzon > Ilocos Region > Province of Pangasinan (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(38 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
(2 more...)

Add feedback

Data Augmentation With Back translation for Low Resource languages: A case of English and Luganda

Kimera, Richard, Heo, Dongnyeong, Rim, Daniela N., Choi, Heeyoul

arXiv.org Artificial IntelligenceMay-6-2025

In this paper,we explore the application of Back translation (BT) as a semi-supervised technique to enhance Neural Machine Translation(NMT) models for the English-Luganda language pair, specifically addressing the challenges faced by low-resource languages. The purpose of our study is to demonstrate how BT can mitigate the scarcity of bilingual data by generating synthetic data from monolingual corpora. Our methodology involves developing custom NMT models using both publicly available and web-crawled data, and applying Iterative and Incremental Back translation techniques. We strategically select datasets for incremental back translation across multiple small datasets, which is a novel element of our approach. The results of our study show significant improvements, with translation performance for the English-Luganda pair exceeding previous benchmarks by more than 10 BLEU score units across all translation directions. Additionally, our evaluation incorporates comprehensive assessment metrics such as SacreBLEU, ChrF2, and TER, providing a nuanced understanding of translation quality. The conclusion drawn from our research confirms the efficacy of BT when strategically curated datasets are utilized, establishing new performance benchmarks and demonstrating the potential of BT in enhancing NMT models for low-resource languages.

artificial intelligence, machine translation, natural language, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3711542.3711594

2505.02463

Country:

Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.06)
Asia > South Korea (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry: Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Evolutionary Algorithms Approach For Search Based On Semantic Document Similarity

Muniyappa, Chandrashekar, Kim, Eujin

arXiv.org Artificial IntelligenceFeb-20-2025

Advancements in cloud computing and distributed computing have fostered research activities in Computer science. As a result, researchers have made significant progress in Neural Networks, Evolutionary Computing Algorithms like Genetic, and Differential evolution algorithms. These algorithms are used to develop clustering, recommendation, and question-and-answering systems using various text representation and similarity measurement techniques. In this research paper, Universal Sentence Encoder (USE) is used to capture the semantic similarity of text; And the transfer learning technique is used to apply Genetic Algorithm (GA) and Differential Evolution (DE) algorithms to search and retrieve relevant top N documents based on user query. The proposed approach is applied to the Stanford Question and Answer (SQuAD) Dataset to identify a user query. Finally, through experiments, we prove that text documents can be efficiently represented as sentence embedding vectors using USE to capture the semantic similarity, and by comparing the results of the Manhattan Distance, GA, and DE algorithms we prove that the evolutionary algorithms are good at finding the top N results than the traditional ranking approach.

algorithm, evolutionary algorithm, similarity, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3617733.3617753

2502.19437

Country:

North America > United States > North Dakota > Grand Forks County > Grand Forks (0.14)
Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Duong, Song, Bronnec, Florian Le, Allauzen, Alexandre, Guigue, Vincent, Lumbreras, Alberto, Soulier, Laure, Gallinari, Patrick

arXiv.org Artificial IntelligenceFeb-19-2025

Large Language Models (LLMs), when used for conditional text generation, often produce hallucinations, i.e., information that is unfaithful or not grounded in the input context. This issue arises in typical conditional text generation tasks, such as text summarization and data-to-text generation, where the goal is to produce fluent text based on contextual input. When fine-tuned on specific domains, LLMs struggle to provide faithful answers to a given context, often adding information or generating errors. One underlying cause of this issue is that LLMs rely on statistical patterns learned from their training data. This reliance can interfere with the model's ability to stay faithful to a provided context, leading to the generation of ungrounded information. We build upon this observation and introduce a novel self-supervised method for generating a training set of unfaithful samples. We then refine the model using a training process that encourages the generation of grounded outputs over unfaithful ones, drawing on preference-based training. Our approach leads to significantly more grounded text generation, outperforming existing self-supervised techniques in faithfulness, as evaluated through automatic metrics, LLM-based assessments, and human evaluations.

computational linguistic, conference paper, dataset, (14 more...)

arXiv.org Artificial Intelligence

2502.13674

Country:

North America > United States > South Carolina (0.04)
North America > Mexico (0.04)
North America > Dominican Republic (0.04)
(19 more...)

Genre:

Research Report > New Finding (0.92)
Research Report > Experimental Study (0.68)

Industry:

Health & Medicine (1.00)
Consumer Products & Services > Restaurants (0.93)
Leisure & Entertainment > Sports (0.93)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Long-term prediction of El Ni\~no-Southern Oscillation using reservoir computing with data-driven realtime filter

Jinno, Takuya, Mitsui, Takahito, Nakai, Kengo, Saiki, Yoshitaka, Yoneda, Tsuyoshi

arXiv.org Artificial IntelligenceJan-29-2025

In recent years, the application of machine learning approaches to time-series forecasting of climate dynamical phenomena has become increasingly active. It is known that applying a band-pass filter to a time-series data is a key to obtaining a high-quality data-driven model. Here, to obtain longer-term predictability of machine learning models, we introduce a new type of band-pass filter. It can be applied to realtime operational prediction workflows since it relies solely on past time series. We combine the filter with reservoir computing, which is a machine-learning technique that employs a data-driven dynamical system. As an application, we predict the multi-year dynamics of the El Ni\~no-Southern Oscillation with the prediction horizon of 24 months using only past time series.

artificial intelligence, machine learning, prediction, (13 more...)

arXiv.org Artificial Intelligence

2501.17781

Country:

Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.04)
Pacific Ocean (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Continual Self-supervised Learning Considering Medical Domain Knowledge in Chest CT Images

Tasai, Ren, Li, Guang, Togo, Ren, Tang, Minghui, Yoshimura, Takaaki, Sugimori, Hiroyuki, Hirata, Kenji, Ogawa, Takahiro, Kudo, Kohsuke, Haseyama, Miki

arXiv.org Artificial IntelligenceJan-7-2025

We propose a novel continual self-supervised learning method (CSSL) considering medical domain knowledge in chest CT images. Our approach addresses the challenge of sequential learning by effectively capturing the relationship between previously learned knowledge and new information at different stages. By incorporating an enhanced DER into CSSL and maintaining both diversity and representativeness within the rehearsal buffer of DER, the risk of data interference during pretraining is reduced, enabling the model to learn more richer and robust feature representations. In addition, we incorporate a mixup strategy and feature distillation to further enhance the model's ability to learn meaningful representations. We validate our method using chest CT images obtained under two different imaging conditions, demonstrating superior performance compared to state-of-the-art methods.

artificial intelligence, machine learning, representation, (15 more...)

arXiv.org Artificial Intelligence

2501.04217

Country:

Africa > Togo (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Asia > Japan > Hokkaidō (0.04)
(5 more...)

Genre: Research Report > Promising Solution (0.48)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.94)

Add feedback

Improved ICNN-LSTM Model Classification Based on Attitude Sensor Data for Hazardous State Assessment of Magnetic Adhesion Climbing Wall Robots

Ma, Zhen, Xu, He, Dou, Jielong, Qin, Yi, Zhang, Xueyu

arXiv.org Artificial IntelligenceDec-29-2024

Magnetic adhesion tracked climbing robots are widely utilized in high-altitude inspection, welding, and cleaning tasks due to their ability to perform various operations against gravity on vertical or inclined walls. However, during operation, the robot may experience overturning torque caused by its own weight and load, which can lead to the detachment of magnetic plates and subsequently pose safety risks. This paper proposes an improved ICNN-LSTM network classification method based on Micro-Electro-Mechanical Systems (MEMS) attitude sensor data for real-time monitoring and assessment of hazardous states in magnetic adhesion tracked climbing robots. Firstly, a data acquisition strategy for attitude sensors capable of capturing minute vibrations is designed. Secondly, a feature extraction and classification model combining an Improved Convolutional Neural Network (ICNN) with a Long Short-Term Memory (LSTM) network is proposed. Experimental validation demonstrates that the proposed minute vibration sensing method achieves significant results, and the proposed classification model consistently exhibits high accuracy compared to other models. The research findings provide effective technical support for the safe operation of climbing robots

artificial intelligence, machine learning, robot, (16 more...)

arXiv.org Artificial Intelligence

2412.20675

Country:

Asia > China > Heilongjiang Province > Harbin (0.04)
Europe > France > Bourgogne-Franche-Comté > Doubs > Besançon (0.04)
Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.46)
Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback